
Docket No.: 428.1056 
UNITED STATES PATENT & TRADEMARK OFFICE 



Examiner: Unknown Art Unit: Unknown 

Applicant: KM, Young Tae, et al. 

Serial No.: 10/551,508 

Filed September 29, 2005 

For: GENE INVOLVED IN THE BIOSYNTHESIS 



OF CAROTENOH) AND MARINE 
MICROORGANISM, PARACOCCUS 
HAEUNDAENSIS, PRODUCING THE 
CAROTENOH) 

RESPONSE TO NOTICE TO FILE MISSING PARTS 

Commissioner for Patents June 5, 2006 

P.O. Box 1450 
Alexandria, VA 22313-1450 



Sir: 

In response to the Notice to File Missing Parts ("Notice") dated May 10, 2006, enclosed herewith 
is an executed Declaration. As indicated in the Notice, the previous payment will be applied to the 
requisite surcharge of $65.00 (at Small Entity). 

The Notice indicated that the originally submitted Computer Readable Form copy of the 
Sequence Listing does not comply with the requirements of 37 CFR 1.822 and/or 1.823, because the 
original copy contains informalities, for example, misaligned amino acid numbering. 

A substitute Computer Readable copy of the Sequence Listing is enclosed herewith, together with 
the paper copy of the Sequence Listing. The misaligned amino acid numbering has been corrected. The 
substitute Sequence Listing also provides the application serial number and the filing date. No other 
changes have been made to the Sequence Listing, and the required Declaration to that effect is enclosed. 

Pursuant to 37 C.F.R. §1 .136(a)(3), please treat this and any concurrent or future reply in this 
application that requires a petition for an extension of time for its timely submission as incorporating a 
petition for extension of time for the appropriate length of time. The fee associated therewith is to be 

06/08/2006 HKAYPA6H 00000059 022275 10551506 

01FC:2617 65.00 Dft 1 



J- 



charged to Deposit Account No. 02-2275 . 



Any deficiency or overpayment should be charged or credited to Deposit Account No. 02-2275 
A duplicate copy of this sheet is enclosed. 



Respectfully submitted, 
LUCAS & MERCANTI, 



LUCAS & MERCANTI, LLP 
475 Park Avenue South 
New York, New York 10016 
Phone: 212-661-8000 
Fax: 212-661-8002 













Michael N?1vlercanti 1 





Reg. No. 33,966 



"Express Mail" mailing label no. EV 799 378 378 US 
Date of Deposit June 5. 2006 

I hereby certify that this correspondence and/or fee is being deposited 
with the United States Postal Service "Express Mail Post Office to 
Addressee" service under 37 CFR 1 .10 on the date indicated above, in an 
envelope addressed to: "Commissioner for Patents, P.O. Box 1450 
Alexandria, VA 223 1 3-1450". 
LUCAS & MERCANTI, LLP 



By: 

Carla Santos 
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United States FXtent and Trademark Office 



UNITED STATES DEPARTMENT OF COMMERCE 
United Status Pntutit and Trademark OfTiixj 
Addrw: COMMISSIONER FOR PATENTS 
P.O. Box H50 

Alcxandrin, Vuj?inio 223 13-1450 
www.usplo.gov 



U.S. APPLICATION NUMBER NO. 



FIRST NAMED APPLICANT 



ATTY. DOCKET NO. 



10/551,508 



Michael N Mercanti 
475 Park Avenue South 
New York, NY 10016 



Date Mailed: 05/10/2006 



RECEIV 



THANK9 




Tae KIM 




MAY 1 5 2006 



LUCAS & MERCANTI LLP 



428.1056 



INTERNATIONAL APPLICATION NO. 

PCT/KR04/00752 



] 



| I.A. FILING DATE | PRIORITY DATE | 



03/31/2004 



03/31/2003 



CONFIRMATION NO. 8016 
371 FORMALITIES LETTER 

II 








*OC000000018713342* 



NOTIFICATION OF MISSING REQUIREMENTS UNDER 35 U.S.C. 371 IN THE UNITED 

STATES DESIGNATED/ELECTED OFFICE (DO/EO/US) 

The following items have been submitted by the applicant or the IB to the United States Patent and Trademark 
Office as a Designated / Elected Office (37 CFR 1 .495). 

Indication of Small Entity Status 

Copy of the International Application filed on 09/29/2005 
Copy of the International Search Report filed on 09/29/2005 
Preliminary Amendments filed on 09/29/2005 
Biochemical Sequence Diskette filed on 09/29/2005 
Oath or Declaration filed on 09/29/2005 
Biochemical Sequence Listing filed on 09/29/2005 
Request for Immediate Examination filed on 09/29/2005 
U.S. Basic National Fees filed on 09/29/2005 



i he applicant needs to satisfy supplemental fees problems indicated below. 

The following items MUST be furnished within the period set forth below in order to complete the requirements for 
acceptance under 35 U.S.C. 371 : 



Oath or declaration of the inventors, in compliance with 37 CFR 1.497(a) and (b), identifying the application 
by the International application number and international filing date. The current oath or declaration does 
not comply with 37 CFR 1 .497(a) and (b) in that it: 

■ is not executed in accordance with either 37 CFR 1 .66 or 37 CFR 1 .68. 

To avoid abandonment, a surcharge (for late submission of filing fee, search fee, examination fee or oath 
or declaration) as set forth in 37 CFR 1 .492(h) of $65 for a small entity in compliance with 37 CFR 1 .27, 
must be submitted with the missing items identified in this letter. 



SUMMARY OF FEES DUE: 
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Total additional fees required for this application is $-115 for a Small Entity: 
• $65 Surcharge. 

r. 

A copy of the "Sequence Listing" in computer readable form has been submitted. However, the content of 
the computer readable form does not comply with the requirements of 37 CFR 1 .822 and/or 1 .823, as 
indicated on the attached copy of the marked -up "Raw Sequence Listing." Applicant must provide a 
substitute computer readable form (CRF) copy of the "Sequence Listing" and a statement that the content 
of the sequence listing information recorded in computer readable form is identical to the written (on paper 
or compact disc) sequence listing and, where applicable, includes no new matter, as required by 37 CFR 
1*821(e), 1.821(f), 1.821(g), 1.825(b), or 1.825(d). 

(A previous payment of $180 will be applied to the additional fees indicated above.) 

Applicant is cautioned that correction of the above items may cause the specification and drawings page count to 
exceed 100 pages. If the specification and drawings exceed 100 pages, applicant will need to submit the required 
application size fee. 

For questions regarding compliance to 37 CFR 1.821-1.825 requirements, please contact: 

• For Rules Interpretation, call (571) 272-0951 

• For Patentin Software Program Help, call Patent EBC at 1-866-217-9197 or directly at 703-305-3028 / 
703-308-6845 between the hours of 6 a.m. and 12 midnight, Monday through Friday, EST. 

• Send e-mail correspondence for Patentin Software Program Help @ ebc@uspto.gov 

ALL OF THE ITEMS SET FORTH ABOVE MUST BE SUBMITTED WITHIN TWO (2) MONTHS FROM THE c&j 
DATE OF THIS NOTICE OR BY 32 MONTHS FROM THE PRIORITY DATE FOR THE APPLICATION, 
WHICHEVER IS LATER. FAILURE TO PROPERLY RESPOND WILL RESULT IN ABANDONMENT. 

The time period set above may be extended by filing a petition and fee for extension of time under the provisions 
of 37 CFR 1.136(a). 

Applicant is reminded that any communications to the United States Patent and Trademark Office must be mailed 
to the address given in the heading and include the U.S. application no. shown above (37 CFR 1 .5) 

A copy of this notice MUST be returned with the response. 

FRANCINE YOUNG 



Telephone: (703) 308-9140 EXT 215 



PART 1 - ATTORNEY/APPLICANT COPY 











| U.S. APPLICATION NUMBER NO. | 


INTERNATIONAL APPLICATION NO. 


| ATTY. DOCKET NO. 


I 



10/551,508 PCT/KR04/00752 428.1056 



FORM PCT/DO/EO/905 (371 Formalities Notice) 



* * 



RECEIVED 



STIC Biotechnology Systems Branch 

RAW SEQUENCE LISTING 
ERROR REPORT 



KAY S O?- 



LUCAS & ME£C AWTl LLP 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: lOjSSj^ ^> QST 

Source: ^^fT 
Date Processed by STIC: / b I flL / 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDED A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2,2 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment givento all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD- Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> » EFS Submission 

User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, V A 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 

U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, V A 22314 

Revised 01/24/05 
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8 



11 
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Invalid Line Length ;ifhe rules require that a line not weed character^ in .length. This includes : white spaces.* 




^Misaligned Amino The numbering under each 5* aimnUacid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead, .. 



Non-ASCII 



Variable Length 



Palentln 2.0 
"bug" 



The submitted file was not savedut ^SCII(DOS) text; as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequences) contain n's or Xaa*s representing more than one residue. Per Sequence Rules 
each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version.2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. TTiis applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 




Skipped Sequences Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

"(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTIONrSEQ ID NO:X: (insert SEQ ED NO where W is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES " response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence 



^Skipped Sequences Sequence(s) 

"(NEW RULES) <210> sequence id number 

<4l)0> sequence id number 

000 

Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

"(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa'sare present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



10 Invalid <2 1 3> 

Response 



Use of<220> 



Patentln 2.0 
"bug" 



Misuse of n/Xaa 



Per 1 .823 of Sequence Rules, the only valid <21 3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 

Sequences) missing the <220> "Feature'* and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequenced 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 0670 1/1 998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk** function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to lloppy disk. 

"n" can only represent a single nucleotide : Xaa" can only represent a single amino acid 

AMC - Biotechnology Systems Branch - 09/09/21X13 
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RAW SEQUENCE LISTING DATE: 10/14/2005 

PATENT APPLICATION: US/10/551,508 TIME: 11:04:23 

Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRP4\10142005\J551508.raw 

3 <110> APPLICANT: KIM, Young Tae 

4 LEE, Jae Hyung 

7 <120> TITLE OP INVENTION: Gene involved in the biosynthesis of carotenoid and marine 

8 microorganism, paracoccus haeundaesis, producing the 

9 carotenoid 

11 <130> FILE REFERENCE: 428.1056 

13 <140> CURRENT APPLICATION NUMBER: US/10/551,508 
13 <141> CURRENT FILING DATE: 2005-09-29 

13 <150> PRIOR APPLICATION NUMBER: PCT/ KR2 004/000752 

14 <151> PRIOR FILING DATE: 2003-03-31 

16 <150> PRIOR APPLICATION NUMBER: KR2003 -20222 

17 <151> PRIOR FILING DATE: 2003-03-31 

19 <150> PRIOR APPLICATION NUMBER: KR2003-20023 

20 <151> PRIOR FILING DATE: 2003-03-31 
22 <160> NUMBER OF SEQ ID NOS: 18 

24 <170> SOFTWARE: Kopatentln 1.71 



ERRORED SEQUENCES 

52 <210> SEQ ID NO: 3 

53 <211> LENGTH: 1454 

54 <212> TYPE: DNA 

55 <213> ORGANISM: Paracoccus haeundaesis 

57 <400> SEQUENCE: 3 

58 caacttgaga gtttgatcct ggctcagaac gaacgctggc 
60 gtcgagcgag accttcgggt ctagcggcgg acgggtgagt 
62 tctctacgga atagccccgg gaaactggga gtaataccgt. 
64 atttatcgga gaaggatcgg cccgcgttgg attaggtagt 
66 aagccgacga tccatagctg gtttgagagg atgatcagcc 
68 cccagactcc tacgggaggc agcagtgggg aatcttagac 
70 agccatgccg cgtgagtgat gaaggcctta gggttgtaaa 
72 aatgacggta ccagcagaag aagccccggc taactccgtg 
74 gagggggcta gcgttgttcg gaattactgg gcgtaaagcg 
76 tcagaggtga aatcccaggg ctcaaccttg gaactgcctt 
78 tcgagagagg tgagtggaat tccgagtgta gaggtgaaat 
80 accagtggcg aaggcggctc actggctcga tactgacgct 
82 gcaaacagga ttagataccc tggtagtcca cgccgtaaac 
84 aagcatgctt gtcggtgtca cacctaacgg attaagcatt 
86 caagattaaa actcaaagga attgacgggg gcccgcacaa 
88 attcgaagca acgcgcagaa ccttaccaac ccttgacatg 
90 cagctttctc gtaagagacc tgcacacagg tgctgcatgg 
92 gagatgttcg gttaagtccg gcaacgagcg caacccacgt 



.-en** 



ggcaggctta 
aacgcgtggg 
atacgccctt 
tggtggggta 
acactgggac 
aatgggggca 
gctctttcag 
ccagcagccg 
cacgtgggcg 
tgaaactatc 
tcgtagatat 
gaggtgcgaa 
gatgaatgcc 
ccgcctgggg 

gcggtggagc 

gcaggaccgc 
ctgtcgtcag 
ccctagttgc 



1 



acacatgcaa 
aacgtgccct 
tgggggaaag 

atggcccacc 
tgagacacgg 
accctgatct 
ctgggaagat 
cggtaatacg 
gactggaaag 
agtctggagt 
tcggaggaac 
agcgtgggga 
agacgtcggc 
agtacggtcg 
atgtggttta 
tggagagatt 
ctcgtgtcgt 
cagcattcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



fiIe://C:\CRF4\OUTHOLD\VsrJ551508.htm 
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!--> 



B--> 



RAW SEQUENCE LISTING DATE: 10/14/2005 

PATENT APPLICATION: US/10/551,508 TIME: 11:04:23 

Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 

94 ttgggcactc tatggaaact gccgatgata agtcggagga aggtgtggat gacgtcaagt 1140 
96 tctcatggcc cttacgggtt gggctacaca cgtgctacaa tggtggtgac agtgggttaa 1200 
98 tccccaaaag ccatctcagt tcggattgtc ctctgcaact cgagggcatg aagttggaat 1260 
100 cgctagtaat cgcggaacag catgccgcgg tgaatacgtt cccgggcctt gtacacaccg 1320 
102 cccgtcacac catgggagtt ggttctaccc gacgacgctg cgctaacctt cggggggcag 1380 
104 gcggccacgg taggatcagc gactggggtg aagtcgtaac aaggtagccg taggggaacc 1440 



106 tacoqctgga tcac 
107 

325 ^2T0> SEQ ID NO: 5 

326 <211> LENGTH: 729 

327 <212> TYPE: DNA 

328 <213> ORGANISM: crtW gene 

330 <400> SEQUENCE: 5 

331 atgagcgcac atgccctgcc caaggcagat ctgaccgcca ccagcctgat cgtctcgggc 
333 ggcatcatcg ccgcgtggct ggccctgcat gtgcatgcgc tgtggtttct ggacgcggcg 
335 gcgcatccca tcctggcgat cgcgaatttc ctggggctga cctggctgtc ggtcggtctg 
337 ttcttcatcg cgcatgacgc gatgcacggg tcggtcgtgc cggggcgtcc gcgcggcaat 
339 gcggcgatgg gccagctggt cctgtggctg tatgccggat tttcgtggcg caagatgatc 
341 gtcaagcaca tggcccatca ccgccatacc ggaaccgacg acgaccccga tttcgaccat 
343 ggcggcccgg tccgctggta cgcgcgcttc atcggcacct atttcggctg gcgcgagggg 
345 ctgctgctgc ccgtcatcgt gacggtctat gcgctgatcc tgggggatcg ctggatgtac 
347 gtggtcttct ggccgctgcc gtcgatcctg gcgtcgatcc agctgttcgt gttcggcacc 
34 9 tggctgccgc accgccccgg ccacgacgcg ttcccggacc gccataatgc gcggtcgtcg 
351 cggatcagcg accccgtgtc gctgctgacc tgctttcact ttggtggtta tcatcacgaa 
353 caccacctgc acccgacggt gccttggtgg cgcctgccca gcacccgcac caagggggac 

355 accgcatga 

356 <@ 

359 <210> SEQ ID N0^6 t 

i 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 




E--> 



He Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 



E- - > 



360 <211> LENGTH: QA 

361 <212> TYPE: PRT 

362 <213> ORGANISM: crtW amino acid 

364 <400> SEQUENCE: 6 

365 Met Ser Ala His Ala-> Leu Pro Lys Ala Asp Leu Thr Ala Thr SerJ,e'u 

366 1 W> [O @) IS - 
368 

-> 369 

371 Ala Leu Trp. ghf..~Leu Asp Ala Ala Ala His Pro He Leu Ala, lie Ala 

372 * s@>:. 4© q<r { 1 ; :<tf> ■ 

374 Asn Phe Leu Gly Leu Thr Tro^Leu Ser VaJ. Gly Leu Phe Phe' He Ala 

375 ~ (p ' ' . ' & 6© 

377 His AsEuAla Met His Gly Ser Val Val Pro Gly^fcrg. J?ro Arq Gly As 

378 ^ 9©(£oJ> 

380 Al<a Ala Met Gly Gin Leu Vai Leu Tip Leu Tvr Ala Gly Phe Ser^Tro 

381 tKt$£> €fD * \{£> * * . ' 
383 Arg Lys Me»t He Val Lys His. Met Alajiis, His Ar.g His Thr Glv, T 

b--> 384 \Qftf$C~ fp^.Voy - li^ P ,r 

386 Asp Asp Asp P^o Asp. Php Asp His GlyGLy Pro Va^L Arg Trp jyr Al,a 

b--> 387 @; \» l^o^up }p!Tt 

389 Arg Phe He Gly Thr Tyr Phe Glv Trp Arg Glu Gly Leu Le,u Leu 



;--> 



2 

E 



- - > 



- - > 
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RAW SEQUENCE LISTING DATE: 10/14/2005 

PATENT APPLICATION: OS/10/551, 508 TIME: 11:04:23 



E--> 



E 
E 



- -> 



- -> 



!--> 



B 
B 



- - > 



- - > 



B 
E 



- - > 



- - > 



B--> 



Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 



390 
392 
393 
395 
396 
398 
399 
401 
402 
404 
405 
407 
408 
410 
414 
415 
416 
417 
419 
420 
422 
424 
426 
428 
430 
432 
434 
436 
437 
440 
441 
442 
443 
445 
446 
447 
449 
450 
452 
453 
455 
456 
458 
459 
461 
462 
464 
465 
467 



130 



135 



140 



5ft 



Val Val Phe Trp Pro. Leu^Pro Ser lie Leu Ala Ser lie Gin Leu Phe 



Val P^e Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 
Asp Arg His Asn Ala Arg Ser Ser Arg lie Ser Asp Pro Val.Se.r L.eu 
Leu TJir Cvs Phe His Plje Gly Gly Tyr His His Glu His * riis Leu His 
Pro Thr Val Pro Trp Trp Ar£ Leu Pr>o Ser Thr Ar g T l\r Lys Gly Asp. 
rptt Ala 

; *:lj> SEQ ID NO: 7 
<211> LENGTH: 489 
<212> TYPE: DNA 
<213> ORGANISM: crtz gene 
<400> SEQUENCE: 7 

atgaccaatt tcctgatcgt cgtcgccacc gtgctggtga tggagttgac ggcctattcc 
gtccaccgtt ggatcatgca cggccccctg ggctggggct ggcacaagtc ccaccacgag 
gaacacgacc acgcgctgga aaagaacgac ctgtacggcc tggtctttgc ggtgatcgcc 
acggtgctgt tcacggtggg ctggatctgg gcgccggtcc tgtggtggat cgctttgggc 
atgaccgtct atgggctgat ctatttcgtc ctgcatgacg ggctggttca tcagcgctgg 
ccgttccgct atatcccgcg caagggctat gcccgccgcc tgtatcaggc ccaccgcctg 
caccacgcgg tcgagggacg cgaccattgc gtcagcttcg gcttcatcta tgcgccgccg 
gtcgacaagc tgaagcagga cctgaagacg tcgggcgtgc tgcgggccga ggcgcaggag 




469 

<210> SEQ ID NO: 8 
<211> LENGTH: 162 
<212> TYPE: PRT 

<213> ORGANISM: crtZ amino acid 
<400> SEQUENCE: 8 

Met Thr Asn Phe Leu He Val Val Ala 

1 5 
Thr Ala Tyr Ser Val His Arg Trp He 

20 

Gly Trp His Lys Ser His His Glu Glu 

35 40 
Asn Asp Leu Tyr Gly Leu Val Phe Ala 

50 55 
Thr Val Gly Trp He Trp Ala Pro 

65 70 
Met Thr Val Tyr Gly Leu He Tyr Phe 

85 

His Gin Arg Trp Pro Phe Arg Tyr He 

100 

Arg Leu Tyr Gin Ala His Arg Leu His 



Thr Val Leu 
10 

Met His Gly 
25 

His Asp His 
Val He Ala 



Val Leu Trp Trp 



Val Leu His 
90 

Pro Arg Lys 

105 
His Ala Val 



Val Met Glu Leu 

Pro Leu Gly Trp 

30 

Ala Leu Glu Lys 
45 

Thr Val Leu Phe 
60 

He Ala Leu Gly 
75 

Asp Gly Leu Val 

Gly Tyr Ala Arg 

110 

Glu Gly Arg Asp 





175 




240 



60 
120 
180 
240 
300 
360 
420 
480 



15 



80 



95 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: 03/10/551,508 



DATE: 10/14/2005 
TIME: 11:04:23 



Input Set : N:\FANTU\10551508. raw. txt 
Output Set: N:\CRF4\10142005\J551508.raw 
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E- 


• * > 


468 




115 

• 






120 


125 






470 


His Cys 


Val Ser Phe Gly Phe 


lie 


Tyr Ala Pro Pro 


Val Asp Lys Leu 


E- 




471 


130 






135 




140 






473 


Lys Gin Asp Leu Lys Thr Ser Gly 


Val Leu Arg Ala Glu Ala Gin Glu 


E- 




474 

m f m 


145 


150 




155 






476 


Arg Thr 


















<210> SEQ ID NO: 10 














528 


<211> LENGTH: 386 






* 








D Z J7 


<212> TYPB: PRT 














51fl 

J J u 


<213> ORGANISM: crtY amino acid 










c *3 o 


<400> SEQUENCE: 10 














3 J J 


Val Thr 


His Asp Val Leu 


Leu 


Ala 


Gly Ax a Gly Leu 


Ala Asn Gly Leu 


E- 


■ - > 


C ^ it 


1 


5 






10 








C O £T 
3 JO 


He Ala 


Leu /Via Leu Arg 


Ala 


Ala 


Arg Pro Asp Leu 


Arg Val Leu Leu 


E- 


- > 


c ^ 1 

537 




20 






25 


30 






C*! Q 


Leu Asp 


His Ala Ala Gly 


Pro 


Ser 


Asp Gly His Thr 


Trp Ser Cys His 


E- 


- > 


540 




35 






40 


45 






54 ^ 


Asp Pro 


Asp Leu Ser Pro 


His 


Trp 


Leu Ala Arg Leu 


Lys Pro Leu Arg 


E- 


-> 


C A O 

543 


50 






sm * m m 

55 




60 






54 5 


Arg Ala 


Asn Trp Pro Asp 


Gin 


Glu 


Val Arg Phe Pro 


Arg His Ala Arg 


E- 


- > 


546 






70 






75 






C A Q 


Arg Leu 


Ala Thr Gly Tyr 


Gly 


Ser 


Leu Asp Gly Ala 


Ala Leu Ala Asp 


mm 

E- 


-> 


C A O 

549 




85 






90 








J jl 


Ala Val 


Ala Arg Ser Gly 


Ala 


Glu 


lie Arg Trp Asn 


Ser Asp lie Ala 


K- 


— > 


CO 

552 




100 






105 


110 






3D* 


Leu Leu 


Asp Glu Gin Gly 


Ala 


Tnr 


Leu Ser Cys Gly 


• 

Thr Arg lie Glu 


mm 

E- 


- > 


e c c 
555 




115 






120 


125 






C T 


Ala Gly 


Ala Val Leu Asp 


Gly 


Arg 


Gly Ala Gin Pro 


Ser Arg His Leu 




-> 


ceo 
558 


130 




135 




140 






DO U 


Thr Val Gly Phe Gin Lys 


Phe 


val 


Gly Val Glu He 


Glu Thr Asp Cys 


Ji- 


- > 


5ol 


145 


150 




155 






50J 


Pro His Gly Val Pro Arg 


Pro 


Met 


He Met Asp Ala 


Thr Val Thr Gin 


ff— 


— > 






165 






170 










Gin Asp Gly Tyr Arg Phe 


He 


Tyr 


Leu Leu Pro Phe 


Ser Pro Thr Arg 


is— 


— > 


DO / 




180 






185 


190 








He Leu 


He Glu Asp Thr Arg 


Tyr 


Ser Asp Gly Gly Asn Leu Asp Asp 


E- 


-> 


570 




195 






200 


205 






572 


Asp Ala 


Leu Ala Ala Ala 


Ser 


His 


Asp Tyr Ala Arg Gin Gin Gly Trp 


E- 


-> 


573 


210 




215 




220 






575 


Thr Gly Ala Glu Val Arg Arg 


Glu 


Arg Gly He Leu 


Pro lie Ala Leu 


E- 


-> 


576 


225 


230 




235 






578 


Ala His Asp Ala Ala Gly Phe 


Trp 


Ala Asp His Ala Glu Gly Pro Val 


E- 


- > 


579 




245 






250 








581 


Pro Val Gly Leu Arg Ala 


Gly 


Phe 


Phe His Pro Val 


Thr Gly Tyr Ser 


E- 


- > 


582 




260 






265 


270 






584 


Leu Pro Tyr Ala Ala Gin 


Val 


Ala 


Asp Val Val Ala Gly Leu Ser Gly 


E- 


-> 


585 




275 






280 


285 






587 


Pro Pro Gly Thr Asp Ala 


Leu 


Arg 


Gly Ala He Arg Asp Tyr Ala lie 


E- 


- > 


588 


290 






295 




300 



160 



15 



80 



95 



160 



175 



240 
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590 Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu 
E--> 591 305 310 315 320 

593 Phe Arg Gly Cys Ala Pro Asp Arg Arg Tyr Thr Leu Leu Gin Arg Phe 
E--> 594 325 330 335 

596 Tyr Arg Met Pro His Gly Leu lie Glu Arg Phe Tyr Ala Gly Arg Leu 
B--> 597 340 345 350 

599 Ser Val Ala Asp Gin Leu Arg He Val Thr Gly Lys Pro Pro He Pro 
B--> 600 355 360 365 

602 Leu Gly Thr Ala He Arg Cys Leu Pro Glu Arg Pro Leu Leu Lys Glu 
E--> 603 370 375 380 

605 Asn Ala 

606 385 

609 <210> SEQ ID NO: 11 

610 <211> LENGTH: 1506 

611 <212> TYPE: DNA 

612 <213> ORGANISM: crtl gene 

614 <400> SEQUENCE: 11 

615 atgaacgccc attcgcccgc ggccaagacc gccatcgtga tcggcgcagg ctttggcggg 60 
' 617 ctggccctgg ccatccgcct gcagtccgcg ggcatcgcca ccaccctggt cgaggcccgg 120 

619 gacaagcccg gcgggcgcgc ctatgtctgg cacgatcagg gccatgtctt cgacgcgggc 180 
621 ccgaccgtca tcaccgaccc cgatgcgctc aaggagctgt gggcgctgac cgggcaggac 240 
623 atggcgcgcg acgtgacgct gatgccggtg tcgcccttct atcgactgat gtggccgggc 300 
625 gggaaggtct tcgattacgt gaacgaggcc gatcagctgg agcgccagat cgcccagttc 360 
627 aacccggacg acctggaagg ataccgccgc ttccgtgatt acgcggagga ggtgtatcag 420 
629 gagggctacg tcaagctggg caccgtgccc ttcctcaagc tgggccagat gctcaaggcc 480 
631 gcgcccgcgc tgatgaagct ggaggcctat aagtccgtcc atgccaaggt cgcgaccttc 540 
633 atcaaggacc cctatctgcg gcaggcgttt tcgtatcaca cgctgctggt gggcgggaat 600 
635 cccttctcga ccagctcgat ctatgcgctg atccacgcgc tggagcggcg cggcggggtc 660 
637 tggttcgcca agggcggcac caaccagctg gtcgcgggca tggtcgcgct gttcgaacgg 720 
. 639 cttggcggcc agatgatgct gaacgccaag gtcgcccgga tcgagaccga gggcgcgcgg 780 
641 accacgggcg tcaccctggc ggacgggcgg tctttaaggg ccgacatggt cgccagcaac 840 
643 ggcgacgtca tgcacaacta tcgcgacctg ctgggccaca cggcccgcgg gcagagccgc 900 
645 gcgaaatcgc tggaccgcaa gcgctggtcc atgtcgttgt tcgtgctgca tttcggtctg 960 
647 cgcgaggcgc ccaaggacat cgcgcatcac accatcctgt tcggcccccg ctacagggag 1020 
649 ctggtcaacg agatcttcaa gggcccgaag ctggccgagg atttctcgct gtacctgcat 1080 
651 tcgccctgca cgaccgatcc ggacatggcg cctccgggca tgtccacgca ttacgtgctg 1140 
653 gcccccgtgc cgcatctggg ccgcgccgag atcgattggg cggtcgaggg gccgcgctat 1200 
655 gccgaccgca tcctggcgtc cctggaggag cggctgatcc cgaacctgcg cgccaacctg 1260 
657 accacgacgc gcatcttcac gcccgccgat ttcgccagcg aactgaacgc ccatcacggc 1320 
659 agcgccttct cggtcgagcc gatcctgacg caatccgcgt ggttccggcc gcacaaccgc 1380 
661 gacaagacga tccgcaactt ctatctggtc ggcgcgggca cccatccggg cgcgggcatt 1440 
663 ccgggcgtcg tgggctcggc caaggccacg gcccaggtga tgctgtccga cctggcgggc 1500 

E--> 665 gcatga - ■ 9 l^fo 

666 1506 — " 

669 <210> SEQ ID NO: 12 

670 <211> LENGTH: 501 

671 <212> TYPE: PRT 

672 <213> ORGANISM: crtl amino acid 
674 <400> SEQUENCE: 12 
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DAD 




A r"* 

Asa 


&1 a U<i o Cor 

iiia nis oci 


XT L\J 


Al a 


Ala 


E- 


- > 


676 


1 






5 










678 


Gly 


Pne 


Gly Giy Leu 


Aia 


iieu 


Ala 


E- 


-> 


679 






20 












6 81 


Aid 


Tnr 


inr Leu vai 


(jrlu 


Ala 


A y 

Arg 


E- 


- > 


682 






35 












684 


Val 


Trp 


His Asp Gin 


Giy 


his 


vai 


E- 


-> 


685 




50 








55 






687 


Thr 


Asp 


Pro Asp Ala 


Leu 


Lys 


Glu 


E- 


-> 


688 


65 








70 








690 


Met 


Ala 


Arg Asp Val 


Thr 


Leu 


Met 


E- 


-> 


691 








85 










693 


Met 


Trp 


Pro Gly Gly 


Lys 


Val 


Phe 


E- 


-> 


694 






100 












696 


Leu 


Glu 


Arg Gin lie 


Ala 


Gin 


Phe 


E- 


-> 


697 






115 












699 


Arg 


Arg 


Phe Arg Asp 


Tyr 


Ala 


Glu 


E- 


- > 


700 




130 








135 






702 


Lys 


Leu 


Gly Thr Val 


Pro 


Phe 


Leu 


B- 


- > 


703 


145 






150 








705 


Ala 


Pro 


Ala Leu Met 


Lys 


Leu 


Glu 


E- 


-> 


706 






165 










708 


val 


Ala 


Thr Phe lie 


Lys 


Asp 


Pro 


E- 


- > 


709 






180 












711 


His 


Thr 


Leu Leu Val 


Gly Gly 


Asn 


E- 


- > 


712 






195 












714 


Ala 


Leu 


He His Ala 


Leu 


Glu 


Arg 


E- 


-> 


715 




210 








215 






717 


Gly 


Gly 


Thr Asn Gin 


Leu 


Val 


Ala 


B- 


-> 


718 


225 






230 








720 


Leu 


Gly 


Gly Gin Met 


Met 


Leu 


* 

Asn 


E- 


- > 


721 






245 










723 


Glu 


Gly 


Ala Arg Thr Thr Gly 


Val 


E- 


- > 


724 






260 












726 


Arg 


Ala 


Asp Met Val 


Ala 


Ser 


Asn 


B- 


- > 


727 






275 












729 


Asp 


Leu 


Leu Gly His 


Thr 


Ala 


Arg 


E- 


-> 


730 




290 






295 






732 


Asp 


Arg 


Lye Arg Trp 


Ser 


Met 


Ser 


E- 


-> 


733 


305 






310 








735 


Arg 


Glu 


Ala Pro Lys Asp 


He 


Ala 


E- 


- > 


736 








125 










738 


Arg 


Tyr 


Arg Glu Leu 


Val 


Asn 


Glu 


E- 


-> 


739 






340 












741 


Glu 


Asp 


Phe Ser Leu Tyr Leu 


His 


E- 


- > 


742 






355 












744 


Met 


Ala 


Pro Pro Gly Met 


Ser 


Thr 


E- 


-> 


745 




370 








375 






747 


His 


Leu 


Gly Arg Ala 


Glu 


He 


Asp 
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Lys Thr Ala He Val He 

10 

lie Arg Leu Gin Ser Ala 
25 

Asp Lys Pro Gly Gly Arg 
40 

Phe Asp Ala Gly Pro Thr 

60 

Leu Trp Ala Leu Thr Gly 

75 

Pro Val Ser Pro Phe Tyr 

90 

Asp Tyr Val Asn Glu Ala 
105 

Asn Pro Asp Asp Leu Glu 
120 

Glu Val Tyr Gin Glu Gly 

140 

Lys Leu Gly Gin Met Leu 

155 

Ala Tyr Lys Ser Val His 

170 

Tyr Leu Arg Gin Ala Phe 
185 

Pro Phe Ser Thr Ser Ser 
200 

Arg Gly Gly Val Trp Phe 

220 

Gly Met Val Ala Leu Phe 

235 

Ala Lys Val Ala Arg He 

250 

Thr Leu Ala Asp Gly Arg 
265 

Gly Asp Val Met His Asn 
280 

Gly Gin Ser Arg Ala Lys 

300 

Leu Phe Val Leu His Phe 

315 

His His Thr lie Leu Phe 

330 

lie Phe Lys Gly Pro Lys 
345 

Ser Pro Cys Thr Thr Asp 
360 

His Tyr Val Leu Ala Pro 

380 

Trp Ala Val Glu Gly Pro 



Gly Ala 

Gly He 

30 
Ala Tyr 

45 
Val He 

Gin Asp 

Arg Leu 

Asp Gin 

110 
Gly Tyr 
125 

Tyr Val 

Lys Ala 

Ala Lys 

Ser Tyr 

190 
He Tyr 
205 

Ala Lys 

Glu Arg 

Glu Thr 

Ser Leu 

270 
Tyr Arg 
285 

Ser Leu 

Gly Leu 

Gly Pro 

Leu Ala 
350 

Pro Asp 

365 

Val Pro 
Arg Tyr 



15 



80 



95 



160 



175 



240 



255 
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E--> 748 385 390 395 400 

750 Ala Asp Arg He Leu Ala Ser Leu Glu Glu Arg Leu He Pro Asn Leu 
B--> 751 405 410 415 

753 Arg Ala Asn Leu Thr Thr Thr Arg He Phe Thr Pro Ala Asp Phe Ala 
B --> 754 420 425 430 

756 Ser Glu Leu Asn Ala His His Gly Ser Ala Phe Ser Val Glu Pro lie 
E--> 757 435 440 445 

759 Leu Thr Gin Ser Ala Trp Phe Arg Pro His Asn Arg Asp Lys Thr He 
B--> 760 450 455 460 

762 Arg Asn Phe Tyr Leu Val Gly Ala Gly Thr His Pro Gly Ala Gly He 
B--> 763 465 470 475 480 

765 Pro Gly Val Val Gly Ser Ala Lys Ala Thr Ala Gin Val Met Leu Ser 
E--> 766 485 490 495 

768 Asp Leu Ala Gly Ala 
E--> 769 500 

772 <210> SEQ ID NO: 13 

773 <211> LENGTH: 915 

774 <212> TYPE: DNA 

775'<213> ORGANISM: crtB gene 

777 <400> SEQUENCE: 13 

778 atgagcgatc tggtcctgac ctcgaccgag gcgatcaccc aagggtcgca aagctttgcc 60 

760 acggcggcca agctgatgcc gccgggcatc cgcgacgaca cggtgatgct ctatgcctgg 120 
782 tgccgccacg cggatgacgt gatcgacggt caggccctgg gcagccgccc cgaggcggtg 180 
784 aacgacccgc aggcgcggct ggacggcctg cgcgtcgaca cgctggcggc cctgcagggc 240 
786 gacggtccgg tgaccccgcc ctttgccgcg ctgcgcgcgg tggcgcggcg gcatgatttc 3 00 
788 ccgcaggcct ggcccatgga cctgatcgaa ggcttcgcga tggatgtcga ggcgcgcgac 360 
790 tatcgcacgc tggatgacgt gctggaatat tcctatcacg tcgcaggcat cgtcggcgtg 420 
792 atgatggccc gcgtgatggg cgtgcgcgac gatcctgtcc tggaccgcgc ctgcgacctg 480 
794 gggctggcgt tccagctgac caacatcgcg cgcgacgtga tcgacgatgc gcgcatcggg 540 
796 cggtgctatc tgccggggga ctggctggac caggcgggcg cgcggatcga cgggccggtg 600 
798 ccgtcgccgg agctgtacac agtgatcctc cggctgttgg atgaggcgga accctattac 660 
800 gcgtcggcgc gggtgggtct ggcggatctg ccaccgcgct gcgcctggtc catcgccgcc 720 
802 gcgctacgga tctatcgcgc catcgggctg cgcatccgca agagcgggcc gcaggcctat 780 
804 cgccagcgga tcagcacgtc caaggctgcc aagatcggcc tgctgggcgt cgggggctgg 840 
806 gatgtcgcgc gatcacgcct gccgggggcg ggcgtgtcgc ggcagggcct ctggacccgg 900 

E--> 808 ccgcatcacg tctag 

809 915> ■ 

812 <210> SEQ ID NO: 14 

813 <211> LENGTH: 304 

814 <212> TYPE: PRT 

815 <213> ORGANISM: crtB amino acid 

817 <400> SEQUENCE: 14 

818 Met Ser Asp Leu Val Leu Thr Ser Thr Glu Ala lie Thr Gin Gly Ser 
E--> 819 15 10 15 

821 Gin Ser Phe Ala Thr Ala Ala Lys Leu Met Pro Pro Gly He Arg Asp 
E--> 822 20 25 30 

824 Asp Thr Val Met Leu Tyr Ala Trp Cys Arg His Ala Asp Asp Val lie 
E--> 825 35 40 45 

827 Asp Gly Gin Ala Leu Gly Ser Arg Pro Glu Ala Val Asn Asp Pro Gin 



n 
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E- 


-> 


n *t n 

828 




50 




55 


60 








O 1 f\ 

830 


Ala 


Arg 


Leu Asp Gly Leu Arg Val 


Asp Thr Leu Ala Ala Leu 


Gin Gly 


E- 


- > 


831 


65 




WW f\ 

70 




mm mm 

75 








rt "5 T 


Asp 


Gly 


Pro Val Tnr Pro pro 


Pne 


Ala Ala Leu Arg Ala Val 


Ala Arg 


B- 


-> 


834 






85 




90 


• 






836 


Arg 


HIS 


Asp Phe Pro Gin Ala 


Trp 


Pro Met Asp Leu He Glu 


Gly Phe 


E- 


- > 


837 






100 




105 


110 






839 


Ala 


Met 


Asp Val Glu Ala Arg 


Asp 


Tyr Arg Thr Leu Asp Asp 


Val Leu 


E- 


-> 


840 






115 




120 


125 






842 


Glu 


Tyr 


Ser Tyr His Val Ala 


Gly 


^^fc m WM* ^fc ^fe WW- WW, A A Wa. ^w 

He Val Gly Val Met Met 


Ala Arg 


B- 


- > 


843 




130 




135 


140 








845 


Val 


Met 


Gly Val Arg Asp Asp 


Pro 


Val Leu Asp Arg Ala Cys 


Asp Leu 


E- 


-> 


846 


145 




150 




155 








648 


Gly 


Leu 


Ala Phe Gin Leu Thr 


Asn 


He Ala Arg Asp Val He 


Asp Asp 


E- 


- > 


849 






165 




170 








851 


Ala 


Arg 


He Gly Arg Cys Tyr 


Leu 


Pro Gly Asp Trp Leu Asp 


Gin Ala 


E- 


- > 


852 






180 




185 


190 






854 


Gly 


Ala 


Arg lie Asp Gly Pro 


Val 


Pro Ser Pro Glu Leu Tyr 


Thr Val 


E- 


-> 


855 






195 




200 


205 






857 


lie 


Leu 


Arg Leu Leu Asp Glu 


Ala 


Glu Pro Tyr Tyr Ala Ser 


Ala Arg 


E- 


- > 


858 




210 


215 


220 








860 


Val 


Gly Leu Ala Asp Leu Pro 


Pro 


Arg Cys Ala Trp Ser He 


Ala Ala 


E- 


- > 


861 


225 




230 




235 








863 


Ala 


Leu Arg He Tyr Arg Ala 


He 


Gly Leu Arg He Arg Lys 


Ser Gly 


B- 


- > 


864 






245 




250 






• 


866 


Pro 


Gin Ala Tyr Arg Gin Arg 


He 


Ser Thr Ser Lys Ala Ala 


Lys He 


E- 


-> 


867 






260 




265 


270 






869 


Gly 


Leu Leu Gly Val Gly Gly 


Trp 


Asp Val Ala Arg Ser Arg 


Leu Pro 


B- 


-> 


870 






275 




280 


285 






872 


Gly 


Ala Gly Val Ser Arg Gin 


Gly 


Leu Trp Thr Arg Pro His 


His Val 


E- 


-> 


873 




290 


295 


300 


• 






915 


<210> SEQ ID NO: 16 




• 








916 


<211> LENGTH: 293 






• 






917 


<212> TYPE: PRT 












918 


<213> ORGANISM: crtE amino acid 










920 


<400> SEQUENCE: 16 












921 


Met Arg Arg Asp Val Asn Pro 


He 


His Ala Thr Leu Leu Gin 


Thr Arg 


E- 


-> 


922 


1 




5 




10 








924 


Leu 


Glu 


Glu He Ala Gin Gly 


Phe 


Gly Ala Val Ser Gin Pro 


Leu Gly 


E- 


- > 


925 






20 




25 


30 






927 


Ala 


Ala 


Met Ser His Gly Ala 


Leu 


Ser Ser Gly Arg Arg Phe 


Arg Gly 


E- 


- > 


928 






35 




40 


45 






930 


Met 


Leu 


Met Leu Leu Ala Ala 


Glu 


Ala Ser Gly Gly Val Cys 


Asp Thr 


E- 


- > 


931 




50 




55 


60 








933 


He 


Val Asp Ala Ala Cys Ala 


Val 


Glu Met Val His Ala Ala 


Ser Leu 


B- 


-> 


934 


65 




70 




75 








936 


He 


Phe Asp Asp Leu Pro Cys 


Met 


Asp Asp Ala Gly Leu Arg 


Arg Gly 


E- 


- > 


937 






85 




90 








939 


Arg 


Pro 


Ala Thr His Val Ala 


His 


Gly Glu Ser Arg Ala Val 


Leu Gly 



95 



160 



175 



240 



255 





A 



15 



80 



95 
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* 

B--> 940 100 105 110 

942 Gly lie Ala Leu He Thr Glu Ala Met Ala Leu Leu Ala Gly Ala Arg 
E--> 943 115 120 125 

945 Gly Ala Ser Gly Thr Val Arg Ala Gin Leu Val Arg He Leu Ser Arg 
E--> 946 130 135 140 

948 Ser Leu Gly Pro Gin Gly Leu Cys Ala Gly Gin Asp Leu Asp Leu His 
B--> 949 145 150 155 

951 Ala Ala Lys Asn Gly Ala Gly Val Glu Gin Glu Gin Asp Leu Lys Thr 
E--> 952 165 170 175 

954 Gly Val Leu Phe He Ala Gly Leu Glu Met Leu Ala Val He Lys Glu 
B--> 955 180 185 190 

957 Phe Asp Ala Glu Glu Gin Thr Gin Met He Asp Phe Gly Arg Gin Leu 
B--> 958 195 200 205 

960 Gly Arg Val Phe Gin Ser Tyr Asp Asp Leu Leu Asp Val Val Gly Asp 
B--> 961 210 215 220 

963 Gin Ala Ala Leu Gly Lys Asp Thr Gly Arg Asp Ala Ala Ala Pro Gly 
E--> 964 225 230 235 

966 Pro Arg Arg Gly Leu Leu Ala Val Ser Asp Leu Gin Asn Val Ser Arg 
E--> 967 245 250 255 

969 His Tyr Glu Ala Ser Arg Ala Gin Leu Asp Ala Met Leu Arg Ser Lys 
B--> 970 260 265 270 

972 Arg Leu Gin Ala Pro Glu He Ala Ala Leu Leu Glu Arg Val Leu Pro 
E--> 973 275 280 285 

975 Tyr Ala Ala Arg Ala 

976 290 
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VERIFICATION SUMMARY DATE: 10/14/2005 

PATENT APPLICATION: US/10/551, 508 TIME: 11:04:24 

Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 

L:13 M:270 C: Current Application Number differs, Replaced Current Application No 
L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:106 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1454 SEQ:3 
L:355 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:729 SEQ: 
L:366 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:6 
M:332 Repeated in SeqNo=6 

L:436 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:489 SEQ : \ 
L:447 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:8 - 
M:332 Repeated in SeqNo=8 

L:534 M-.332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 10*/ 
M:332 Repeated in SeqNo=10 

L:665 M:254 E: No . of Bases conflict, LENGTH : Input : 0 Counted: 1506 SEQ: 11 
L:676 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 12^/ 
M:332 Repeated in SeqNo»12 

Lt808 M:254 E: No. of Bases conflict, LE>?GTH : Input : 0 Counted: 915 SEQ: 13 
L:819 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
M:332 Repeated in SeqNo=14 

L:922 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 16 
M:332 Repeated in SeqNo=16 
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